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ABSTRACT 


■ 


Sequential  designs  are  proposed  for  clinical  trials  to  compare 
two  binomial  success  probabilities,  p^  and  P2>  Approximations  to  the 
operating  characteristics  and  expected  sample  size  are  obtained  and 
compared  with  simulations.  Special  reference  is  made  to  the  problem  of 
comparing  vasopressin  and  placebo  for  stopping  upper  gastrointestinal 
hemorrhage . 
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A SEQUENTIAL  CLINICAL  TRIAL  FOR  TESTING  p1  = p2 
1.  Introduction 

This  paper  presents  a class  of  sequential  tests  for  comparing 
two  binomial  success  probabilities  p^  and  p2>  Although  the  method  is  a 
general  one,  it  was  motivated  by  a clinical  trial  for  testing  the  effi- 
cacy of  vasopressin  (a  hormone  which  constricts  blood  vessels)  in 
stopping  upper  gastrointestinal  hemorrhage;  and  although  this  report  is 
primarily  theoretical,  reference  will  be  made  to  the  trial  of  vaso- 
pressin because  it  seems  to  illustrate  clearly  certain  advantages  and 
disadvantages  of  a sequential  design  in  clinical  trials. 

Two  conditions  indicating  a sequential  design  are  (i)  a 
serious  disease,  so  that  ethical  considerations  mandate  the  early  ter- 
mination of  a trial  in  which  one  treatment  appears  especially 
effective,  and  (ii)  a response  time  which  is  short  compared  to  the 
time  between  patient  arrivals,  so  that  it  is  feasible  to  evaluate  the 
current  state  of  affairs  before  admitting  new  patients  to  the  study. 
Massive  upper  gastrointestinal  hemorrhage  satisfies  these  requirements, 
since  failure  to  control  it  within  hours  may  lead  to  death  or  to  surgi- 
cal intervention. 

One  other  circumstance  which  seems  to  indicate  a sequential 
trial  in  this  particular  case  is  the  existence  of  earlier,  favorable 
reports  on  the  use  of  vasopressin  for  stopping  upper  gastrointestinal 
hemorrhage.  (See  especially  Conn  et  al.,  1975.) 
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Although  certain  reservations  concerning  these  earlier  trials  and  the 
desire  to  investigate  a much  simpler  mode  of  administration  of  the  drug 
suggest  a new  trial,  a sequential  design  provides  protection  against 
the  lengthy  continuation  of  this  trial,  should  the  previous,  favorable 
results  be  repeated. 

For  purposes  of  sequential  analysis  this  trial  is  one  of  com- 
paring the  probability  p^  of  success  using  vasopressin  to  the  probabil- 
ity p£  of  success  with  placebo.  Success  is  defined  as  a cessation  of 
bleeding  within  five  hours  and  no  recurrence  within  six.  Other  end- 
points of  interest  are  the  time  until  bleeding  initially  ceases,  recur- 
rence of  bleeding,  severity  of  bleeding  measured  by  transfusion 
requirements,  the  need  for  surgical  intervention,  and  death.  Choice  of 
a sequential  design  for  testing  p^  = p^  makes  the  implicit  assumption 
that  if  p^  appears  to  be  considerably  larger  than  p^,  that  by  itself  is 
sufficient  to  terminate  the  trial  and  indicate  the  use  of  vasopressin. 

In  practice  one  would  probably  be  reluctant  to  terminate  early  unless  the 
other  factors  also  consistently  favor  vasopressin,  although  it  would 
defeat  the  purpose  of  a sequential  trial  to  insist  that  these  factors 
show  "statistically  significant"  differences  between  treatment  and 
control.  Conversely,  in  the  absence  of  a strong  indication  for  vaso- 
pressin based  on  success  rate  alone,  it  may  be  desirable  to  analyze 
other  factors  rather  carefully.  The  extent  to  which  a sequential 
design  introduces  a bias  which  makes  these  other  analyses  difficult  is 
perhaps  its  most  serious  disadvantage.  (This  point  will  be  discussed 
again  in  Section  2.) 
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In  addition  to  studying  the  specific  problem  of  testing 
p^  - P2»  a primary  goal  of  this  paper  is  to  indicate  how  arguments 
developed  by  Siegmund  (1977,  1978)  to  deal  with  normally  distributed 
data  can  be  adapted  and  supplemented  by  simulations  to  obtain  a reason- 
ably clear  understanding  of  a similar  but  more  difficult  problem. 
Section  2 reviews  pertinent  material  for  an  analogous  problem  with 
normal  data.  A modification  of  the  repeated  significance  tests  advo- 
cated by  Armitage  (1975)  is  suggested  and  their  properties  studied. 
Section  3 returns  to  the  problem  of  testing  p^  = p^-  Mathematical 
results  are  collected  in  three  appendices. 
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2.  Normal  Data  With  Known  Variance 


The  normal  distribution  with  known  variance  is  relatively 
simple  both  conceptually  and  technically  and  suggests  useful  approxima- 
tions for  more  complex  situations.  In  this  section  known  results  for 
the  repeated  significance  tests  advocated  by  Armitage  (e.g.,  Armitage, 
1975)  are  reviewed  and  a modification  of  these  tests  suggested  and 
studied. 

The  simplest  situation  occurs  in  a paired  comparison  design,  in 
which  for  each  n = 1,2,...  the  observation  x^  represents  the  differ- 
ence in  response  of  the  nth  pair  of  subjects,  one  of  whom  receives 

treatment  A and  the  other  treatment  B.  It  is  assumed  that  the  x are 

n 

independent  and  normally  distributed  with  expectation  y and  known  vari- 
2 

ance  a . Let  s = x,  + . . . + x , and  given  b,  > 0 and  = 1,2,... 
n 1 n ° 1 0 

define 

(1)  T,  = first  n > m~  such  that  Is  I > b.a  n*5 

1 — 0 1 n‘  1 

Let  m^  > mjj  be  a positive  integer.  The  sequential  test  of  : y = 0 
against  : y + 0 which  terminates  sampling  at  min(T^,m^)  and  rejects 
Hq  if  and  only  if  T^  £ m^  is  the  repeated  significance  test  of 
Armitage  (1975) . 

Let  9 * y/a.  The  distribution  of  T^  and  hence  the  power  func- 
tion of  this  test  depend  on  y and  a only  through  the  value  of  9.  By 
repeated  numerical  integration  McPherson  and  Armitage  (1971) — see  also 
Armitage  (1975) — have  provided  tables  which  allow  one  to  choose  the 


design  parameters  and  b..  to  attain  a specified  significance  level 


a = P^T^m^}  and  power  1-8  = Pg  {T^£m^}  at  a given  value  0^  ^ 0. 
Accurate  analytic  approximations  to  a and  8 were  given  by  Siegmund 
(1977,  1978) — see  Appendix  A for  a summary  of  the  pertinent  results 
adapted  to  the  present  requirements. 

Consider  a repeated  significance  test  and  a fixed  sample  size 
test  of  the  same  significance  level  and  power  at  some  given  0^  f 0. 

The  advantage  of  the  sequential  test  is  that  if  | 0 | is  large,  indica- 
ting that  one  treatment  is  considerably  superior,  the  expected  sample 
size  of  the  sequential  test  is  much  smaller  than  the  fixed  sample  size. 
Concomitant  disadvantages  are  that  the  sequential  test  has  a consider- 
ably larger  maximum  sample  size  and  less  power  for  detecting  values  0 
closer  to  zero  than  0^. 

A class  of  modified  repeated  significance  tests  which  interpo- 
late the  fixed  sample  size  and  repeated  significance  tests  have  been 
suggested  independently  by  Peto  et  al.  (1976)  and  Siegmund  (1978),  but 
their  properties  have  not  been  studied.  Let  0 < c b and  < o be 
given,  and  let  T be  defined  by  (1)  with  b in  place  of  b^.  Stop  samp- 
ling at  min(T,m)  and  reject  Hq  if  either  T m or  T > m and  1 | > cm^. 
For  fixed  m^  there  are  three  parameters  m,  b,  and  c defining  such  a 
modified  repeated  significance  test  and  hence  there  are  many  tests 
having  a specified  significance  level  and  power  at  a given  0^  / 0. 
Relative  to  a given  repeated  significance  test  defined  by  m^  and  b^, 
the  corresponding  modified  tests  have  m <_  m^  and  b b^.  The  extreme 
case  b = 00  corresponds  to  a fixed  sample  size  test  with  rejection 
region  |sm|  > cm^,  whereas  c = b = b^  and  m = m^  give  a repeated 
significance  test.  Figure  1 illustrates  these  relations. 
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Fixed  sample  size 


Modified  repeated 
significance  test 


Repeated  significance 


Table  1 gives  numerical  examples  Illustrating  various  relations 
among  fixed  sample  size,  repeated  significance  tests,  and  the  modified 
tests  suggested  here.  The  approximations  given  in  Appendices  A and  B 
were  used  to  perform  the  required  calculations,  except  for  those 
entries  which  could  be  obtained  from  Armitage  (1975),  p.  104. 
(Simulations  indicate  that  these  approximations  are  quite  accurate.) 

For  all  tests  the  over-all  significance  level  is  .05  and  m^  = 1.  The 
maximum  sample  sizes  were  taken  to  be  equal  and  large  enough  to  include 
the  apparent  range  of  possibilities  for  the  proposed  vasopressin  study. 
The  entry  m^  denotes  that  fixed  sample  size  which  would  yield  the  same 
power  at  the  indicated  0 as  the  given  sequential  test.  One  measure  of 
the  savings  in  sample  size  for  the  sequential  tests  compared  to  fixed 
sample  size  tests  is  R = (m^  - expected  sample  size)/m^.  For  the  modi- 
fied tests  b was  chosen  fairly  large,  so  that  Pq{t_<  m)  is  slightly  less 
than  .02.  The  entry  1-g*  denotes  the  probability  at  0 of  early 
termination,  Pg{T£m}. 

A reasonable  over-all  conclusion  seems  to  be  that  for  about  a 
10%  increase  in  the  maximum  sample  size  of  a fixed  sample  test,  a modi- 
fied sequential  test  can  yield  about  the  same  power  and  a 40%  reduction 
in  the  expected  sample  size  for  | 0 | large  enough  that  the  power 
approaches  one.  Comparatively,  a repeated  significance  test  obtains  a 
greater  reduction  in  expected  sample  size  for  large  | 0 | at  the  cost  of 
a considerable  increase  in  the  maximum  sample  size  and  some  loss  of 
power.  The  relative  advantages  and  disadvantages  of  the  repeated  sig- 
nificance tests  compared  to  the  modified  tests  become  more  pronounced 
with  larger  patient  horizons. 


I I 
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The  most  obvious  appeal  of  these  modified  repeated  significance 
tests  is  that  they  provide  insurance  against  a long  trial  should  one 
treatment  seem  considerav1y  superior  without  as  large  a maximum  sample 
size  or  as  great  a loss  of  power  to  detect  smaller  differences  as  the 
usual  repeated  significance  tests. 

Although  difficult  to  quantify,  the  following  additional 
arguments  in  favor  of  the  modified  tests  seem  to  warrant  some 
discussion. 

(i)  One  disadvantage  of  the  possible  early  termination  of  a 
sequential  test  is  that  it  may  prevent  the  accumulation  of  sufficient 
evidence  against  to  be  thoroughly  convincing.  For  a repeated 
significance  test,  if  = n < m^,  the  observed  significance  level  or 
P-value  of  the  test  may  be  defined  as  PQ{T^£n}.  This  is  a simple 
index  of  how  convincing  the  data  against  Hq  are;  but  since  P^fT^^n}  is 
approximately  proportional  to  log  n (Siegmund,  1977,  or  Appendix  A), 
even  for  n much  smaller  than  m^  it  may  not  be  appreciably  smaller  than 
the  over-all  significance  level,  Pg(T^£m^}.  For  example,  for  the 
first  repeated  significance  test  in  Table  1,  which  has  m^  = 49  and 
a = .05,  if  T1  = 16,  the  observed  significance  level  is 
P^T^lb}  = .032.  By  way  of  contrast,  for  a modified  test  the  over- 
all significance  level  is 

, k 

(2)  a = P-{T  < m)  + P fT  > m,  CTn5  < | s | < bin5}  . 

If  T = n <_  m,  the  observed  significance  Pq{T<^}  is  no  greater  than 
P^T^m),  which  may  be  made  small  by  taking  b large.  For  the  first 
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modified  test  in  Table  1,  Pq{T£49}  = .018  and  Pq{t^16}  = .011.  A 
similar  aigument  applies  to  other  indices  of  "convincingness,"  e.g. , a 
lower  confidence  bound  on  | 0 | (cf.  Siegmund,  1978). 

(ii)  In  many  cases  the  parameter  0 represents  one  direction  in  a 
multidimensional  space.  Implicit  in  the  use  of  a sequential  test  is 
the  assumption  that  if  | 0 | appears  to  be  large,  that  by  itself  will 
play  a dominant  role  in  the  choice  of  treatment  (although  one  would 
presumably  look  at  other  factors  to  see  that  they  are  reasonably  con- 
sistent with  this  conclusion).  However,  if  ] 0 ] is  small,  a more 
detailed  analysis  of  these  other  factors  may  be  important.  For 
example,  if  vasopressin  reduces  bleeding  sufficiently  that  surgery  need 
not  be  performed  on  an  emergency  basis,  it  would  be  a useful  treatment, 
even  though  its  "success"  rate  may  be  no  higher  than  for  placebo. 

Use  of  a data  dependent  stopping  rule  introduces  a bias  which 
tends  to  make  ] 0 | seem  larger  than  it  actually  is.  (See  Siegmund, 

1978,  for  a discussion  of  the  magnitude  of  this  bias.)  By  using  a 
modified  test  with  b large  enough  that  Pq{T£iii}  is  relatively  small  for 
small  | 0 | , one  reduces  the  biasing  effect  of  the  stopping  rule, 
particularly  for  small  1 0 I when  analysis  of  other  variables  may  play  an 
important  role. 
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3.  Comparing  Two  Binomials 

The  clinical  trial  of  vasopressin  may  be  regarded  as  one  of 
comparing  two  binomial  success  probabilities  p^  and  p2,  where  p^  is  the 
probability  of  success  with  vasopressin  and  p2  the  probability  of 
success  with  placebo.  There  are  possibilities  for  stratifying  the 
population,  e.g.,  according  to  the  cause  of  bleeding,  but  initially  it 
is  easier  to  suppose  that  all  data  are  pooled. 

To  simplify  the  discussion  it  is  assumed  that  observations  are 
taken  in  pairs  with  one  member  of  each  pair  assigned  to  treatment  and 
the  other  to  control.  The  biased  coin  design  of  Efron  (1971)  pro- 
vides a reasonable  scheme  for  approximating  this  situation  while  main- 
taining a high  level  of  unpredictability  as  to  exactly  which  treatment 
will  be  assigned  to  the  next  patient.  For  the  first  two  tests  dis- 
cussed below  patients  may  easily  be  assigned  to  treatment  or  control  in 
a 2 to  1 or  other  ratio.  The  matched  pairs  test  described  later 
relies  heavily  on  approximately  balanced  allocation  to  treatment  and 
control. 

For  theoretical  discussions  of  clinical  trials  it  is  customary 
to  consider  testing  p^  = p2  against  the  two-sided  alternative  p^  ^ p2, 
and  that  custom  is  followed  here.  For  a comparison  of  treatment  and 
placebo  it  is  sometimes  appropriate  to  consider  only  the  one-sided 
alternative  p^  > p2«  The  appropriate  modifications  are  obvious,  and 
for  the  numerical  examples  presented  later  give  tests  having  a signifi- 
cance level  about  one-half  that  of  the  two-sided  tests. 
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Assume  then  that  the  data  consists  of  pairs  (x^.y^), 
(x2»y2)»***»  where  the  x's  and  y's  are  independent  random  variables 
assuming  the  values  1 and  0.  Let  P{xn = l}  = p^,  P{yR * l)  = 

= 1 - p^  and  = 1 - P2*  It  is  desired  to  obtain  a sequential  test 
of  Hq  : p^  = P2  against  : p^  i P2  which  on  the  average  requires  a 
small  number  of  observations  to  reach  a decision  whenever  p^  and  p ^ 
differ  substantially. 

Two  obvious  candidates  are  sequential  versions  of  the  gener- 

9 

alized  likelihood  ratio  test  and  of  the  x test  for  independence  in 
2x2  tables.  As  one  might  expect,  these  tests  perform  similarly;  but 
there  are  slight  differences  which  may  be  important  in  some 
applications. 

Let  H(x)  = x log  x + (1  - x)log(l  - x)  and 

I(x,y)  = H(x)  + H(y)  - 2H  [-^(x+y)].  The  log  generalized  likelihood 

ratio  for  testing  Hq  : p^  = P2  against  H^  : p^  ^ P2  based  on  n pairs  of 

observations  is  Z = nl(x  , y ),  where  x = n ^ E?  x,  and 
n n n n lk 

y^  = n z”  y^.  In  analogy  with  (1),  given  integers  m^  £ m and  real 
numbers  0 < c £ b,  define 

(4)  T = first  n > m_  such  that  (2  £ > b 

— U n 

Stop  at  min(T,m)  and  reject  Hq  if  either  T £ m or  T > m and  (2^)^  > c. 

4 

A Taylor  series  expansion  about  (p^,P2)  shows  that  (2n£^) 
behaves  approximately  like  the  absolute  value  of  a sum  of  n independent 
identically  distributed  random  variables  with  mean 

(5)  y = [2l(p1,p2) 

12 


and  variance 


(6)  ai  = (p^  log2(p1q/q1p)  + p2q2  logZ(p2q/qlP)}/2  Kp^Pj)  , 

where  p = (p^  + p2)/2  and  q = 1 - p.  This  suggests  that  under 

Hj.  : t p2>  one  may  approximate  the  behavior  of  (2n£  )**  by  Brownian 

motion  with  drift  y and  variance  a for  the  purpose  of  computing  the 

power  and  expected  sample  size  of  the  test.  The  relevant  formulas  are 

summarized  in  Appendices  A and  B.  Table  2 below  indicates  that  these 

approximations  are  reasonably  accurate. 

Approximating  the  significance  level  of  this  test  poses  a more 

difficult  problem.  Let  P.  . denote  the  probability  measure  on 

(P1»P2) 

sequences  (x^y^,  (x2>y2),...,  determined  by  p.^  and  p2>  The  signifi- 
cance level  of  the  modified  repeated  significance  test  defined  above  is 
for  P;L  = p2  = p 

(7)  a(p)  = P,  . {t  < m}  + P,  . {T  > m,  (2£  )**  > c}  . 

\P»P/  (P,P)  m 

The  difficulties  arise  with  the  first  term  on  the  right  hand  side  of 
(7).  The  second  term  may  be  approximated  by  the  upper  bound 
P^p  p^(c<  (21^)^ £ b) , which  may  in  turn  be  approximated  by 
2[$>(b)  -$(c)],  since  28,m  is  asymptotically  x with  one  degree  of  free- 
dome  under  Hq.  In  principle  the  methods  of  Lai  and  Siegmund  (1977)  may 
be  adapted  to  give  an  asymptotic  approximation  to  P^  pj{T£m}  as 
b m -►  «>,  and  m^  -►  00  in  such  a way  that  bm  -*•  8^  and  bm^  -*■  9^.  A 

heuristic  sketch  of  the  rather  elaborate  computations  is  given  in 
Appendix  C.  The  resulting  formula  involves  a numerical  integration. 


2, 
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which  was  easy  in  the  case  of  normal  random  variables,  but  in  this  case 
remains  a difficult  unsolved  problem.  A further  "no  overshoot"  approx- 
imation to  this  integral  yields  a crude  but  simple  approximate  upper 
bound:  for  p £ -j. 


(8) 


p(p  p)^T— ~ (ff^pqa)*5  e 3 


[l(5,2p-0C(l-0(2p-lp(2q-l+0] 
(0,2p)n{£ :0Q<  [2I(£,2p-£)  ]**<  0^ 


->s 


dt. 


Table  2 gives  two  numerical  examples,  which  are  roughly  compar- 
able to  the  normal  examples  in  Table  1.  The  first  entry  in  each  cell 
is  a Monte  Carlo  estimate;  the  parenthetical  entries  are  analytic 
approximations.  The  ± figures  are  one  estimated  standard  error.  Where 
no  ± figure  is  given,  the  Monte  Carlo  estimate  is  a relative 
frequency  r,  the  standard  error  of  which  may  be  estimated  by  the  usual 
r(l-r)/N,  where  N is  the  number  of  repetitions  of  the  experiment. 

Except  for  the  probabilities  P,  .{T<m}  and  a(p)  , N = 900.  For  these 

VP,P)  — 

probabilities  N = 5000  and  the  method  of  importance  sampling  mentioned 
in  Appendix  C was  used.  Generally  speaking,  the  analytic  approxima- 
tions are  reasonably  good  except  for  the  null  hypothesis  probabilities 
pj(T£m)  and  a(p) , for  which  they  are  too  large  as  expected.  The 
author  has  performed  other  simulations  and  found  the  approximations  to 
hold  up  over  a wider  range  of  parameter  values  than  those  reported 
here. 

Because  of  the  discreteness  of  the  underlying  data,  the  choice 

of  m~  can  have  a substantial  effect  on  P,  .(T<m).  Taking  m_  about 
0 (p,p)  - 0 

h 

equal  to  m seems  reasonable  and  has  the  additional  desirable  property 


I 


14 


TABLE  2 

SEQUENTIAL  GENERALIZED  LIKELIHOOD  RATIO  TEST 


(pl.p2>* 

P(T<  m) 

P(Reject  H 

0^ 

E(TAm) 

Case  I:  = 7, 

m - 49, 

b * 3.15,  c » 

2.15 

.5,  .5 

.017 

i .001  (.023) 

.045 

± .003  ( 

.053) 

48.5 

± .1 

.7,  .5 

.238 

(.231) 

.474 

( 

.451) 

44.1 

± .4 

.8,  .5 

.629 

(.623) 

.851 

( 

.849) 

35.7 

± .5 

(34.7) 

.4,  .4 

.019 

± .001  (.024) 

.041 

± .002  ( 

.054) 

48.3 

± .1 

.6,  .4 

.208 

(.225) 

.448 

( 

.435) 

44.3 

± .4 

.7,  .4 

.578 

(.571) 

.827 

( 

.816) 

36.5 

± .5 

(37.0) 

00 

<• 

• 

.902 

(.900) 

.983 

( 

.981) 

25.8 

± .4 

(24.9) 

.3,  .3 

.018 

± .001  (.024) 

.046 

± .003  ( 

.054) 

48.3 

± .1 

.7,  .3 

.885 

(.880) 

.979 

( 

.976) 

25.9 

± .4 

(25.8) 

.2,  .2 

.016 

± .001  (.017) 

.046 

i .003  ( 

.047) 

48.4 

± .1 

Case 

II:  mQ  “ 10» 

m = 100,  b ■ 3 

• 2,  c = 

2.15 

.5,  .5 

.018 

± .001  (.021) 

.045 

± .004  ( 

.051) 

98.5 

± .3 

.7,  .5 

.506 

(.510) 

.802 

( 

.775) 

79.0 

± .9 

(82.7) 

.8,  .5 

.948 

(.945) 

.995 

( 

.992) 

45.7 

± .8 

(45.4) 

.4,  .4 

.017 

± .001  (.021) 

.044 

± .004  ( 

.051) 

98.5 

± .3 

.6,  .4 

.479 

(.496) 

.761 

( 

.757) 

79.1 

± .9 

(85.7) 

.7,  .4 

.917 

(.918) 

.988 

( 

.986) 

51.4 

± .9 

(48.4) 

.8,  .4 

.998 

(.998) 

1.00 

(1.00  ) 

28.8 

± .5 

(27.0) 

.3,  .3 

.019 

± .001  (.024) 

.046 

± .004  ( 

.054) 

99.1 

± .3 

.7,  .3 

.998 

(.998) 

1.00 

(1.00  ) 

30.2 

± .6 

(28.3) 

.2,  .2 

.017 

± .001  (.021) 

.035 

± .004  ( 

.051) 

98.9 

± .3 

* 

The  cases  (p^.p^)  “ (.5, .3),  (.6, .3),  (.5, .2),  and  (.6, .2)  are  by 
symmetry  the  same  as  (.7,. 5),  (.7,. 4),  (.8,. 5),  and  (.8,. 4) 
respectively. 
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of  making  ^{T£m}  fairly  constant  as  a function  of  p,  at  least  for 
p not  too  near  0 or  1.  For  this  choice  of  m_,  P.  . (T<m)  also  seems 
to  be  quite  well  approximated  by  the  analogous  probability  for  normal 
random  variables  with  m^  » 1.  Although  there  is  no  apparent  reason  why 
this  should  be  so,  it  seems  to  provide  reasonable  quidelines  for  tenta- 
tively selecting  b and  m subject  to  verification  by  a Monte  Carlo 
experiment. 

2 

The  customary  x statistic  for  testing  p^  = is 

\ = n(Vyn)2/2Pn  V Where  Pn  = I(7n  + V and  qn  = 1 - pn-  A 
sequential  test  analogous  to  that  discussed  above  may  be  defined  by  the 

stopping  rule  (4)  with  in  place  of  (21^).  Again  sampling  termin- 
ates at  min(T,m)  and  is  rejected  if  either  T < i or  T > m and 

X*  > c* 

For  given  b,  c,  m-,  and  m,  simulations  show  that  P,  x{T<m) 
0 (P1»P2^  — 

is  generally  smaller  for  this  test  and  the  expected  sample  size  larger 
than  for  the  likelihood  ratio  tests.  If  b and  c are  decreased  to  make 
P^p  pj(T£m)  and  the  significance  levels  comparable,  there  seems  to  be 
essentially  no  difference  in  the  power  and  expected  sample  size  of  the 
two  tests,  at  least  over  the  range  studied  in  Table  2.  Brownian  motion 
approximations  similar  to  those  of  Table  2 seem  adequate  when  p^  t P2» 
although  in  this  case  the  approximating  random  walk  has  the  slightly 


and  variance 


where 


°1  = plqlf+(pl’p2)  + p2q2f-(pl,p2) 


f±(pi»p2)  ' p q - (px  - “ p2)/4^2/2(p  q)3 


One  slight  advantage  that  the  likelihood  ratio  test  seems  to 
2 

have  over  the  y test  is  that  its  value  of  P.  *{T<m)  may  be  more 

(P.P)  ~ 

nearly  constant  as  p gets  close  to  0 or  1.  Previous  experience  sug- 
gests that  extreme  values  of  p are  not  likely  in  the  case  of  upper 

gastrointestinal  hemorrhage,  and  hence  they  were  not  systematically 

2 

investigated.  Of  course,  the  X statistic  is  easier  to  compute. 
Otherwise,  the  authors'  slight  preference  for  the  likelihood  ratio  test 
is  based  on  the  existence  of  the  primitive  but  interesting  null  hypo- 
thesis theory  which  may  be  developed  for  it  and  which  makes  selection 
of  the  design  parameters  m^,  m,  b,  and  c slightly  easier. 

Yet  a third  test  of  p^  = p£  (one  which  differs  from  the  other 
two  in  several  important  respects)  is  obtained  by  discarding  those 
pairs  for  which  x^  = y^  and  basing  a test  on  the  number  of  times  x^  * 1 
among  those  pairs  for  which  x^  ^ y^.  This  device  was  suggested  by 
Wald  (1947,  p.  107)  and  has  been  advocated  by  Armitage  (1975)  for 
general  use  in  making  sequential  binomial  comparisons.  It  reduces  the 


problem  of  testing  the  equality  of  two  binomial  parameters  p^  and  P2  to 
that  of  testing  whether  a single  binomial  parameter  X equals  where 


Let  n be  the  number  of  pairs  i £ n with  x^  ^ and  let  z 


be 


the  proportion  of  pairs  with  x^  ■ 1 among  those  with  i £ n and  x^  ^ y^. 
The  log  generalized  likelihood  ratio  for  testing  Hq  : X = against 
Hx  : X i ~ is 

I**#  “ [H(z  #)  - H(-|)  ] . 

n n 


In  analogy  with  (4),  given  b,  c,  m^,  and  m let 


(9) 


* first  n^  mg  such  that  (2A^)  > b 

n 


^ || 

Stop  sampling  at  T if  T < m and  otherwise  when  n = m.  Reject  Hq  if 

£ m or  > m and  (21**  ^ > c. 

n 

Although  a direct  numerical  approach  is  easy,  for  present  pur- 
poses it  is  simpler  to  adapt  the  methods  introduced  already  to  compute 
approximately  various  properties  of  these  tests.  For  Brownian  motion 
approximations  to  the  distribution  of  T under  H^,  the  appropriate  mean 
and  variance  are 

(10) 


U2  - (2[H(X)  -H(-l)]}*5 


and 


(11) 


aZ2  - X ( 1 — X)log2 [X/(1-X)]/2[H(X)  -H(|)]  , 


w‘  «re  X - p^q2/(p^q2  + P2CI^)  • A "no  overshoot"  approximation  analogous 
to  (8)  is 

(12)  P(p  p)(T#<m)  < (2a/Tr)'Se'a|  {C(l-C)[H(0-H(-|)]}-is  d^ 

(0,Jj)n{?:e0<{2[H(O-H(is)]}ls<  e1) 
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For  b = 3.15,  c = 2.15,  m = 49,  and  = 8 (it  is  impossible  to  stop 

before  = 8 for  this  value  of  b) , (12)  gives  P.  x{T^<m)  < .020. 

(P>P)  ~ 

Monte  Carlo  estimates  based  on  5000  repetitions  for  this  probability 

and  the  over-all  significance  level  are  respectively  .018  ± .000  and 

.056  ± .002.  For  p^  = about  equal  to  .5,  it  takes  on  average  about 

two  pairs  (x  ,y  ) to  obtain  one  pair  with  x ? y . Thus,  a maximum 

n n n n 

value  m = 49  corresponds  roughly  to  a maximum  value  of  n = 98.  Hence, 
for  p^  = P2  = .5  this  test  is  comparable  to  the  second  test  in  Table  2, 
although  its  over-all  significance  level  is  somewhat  larger.  Table  3 
gives  900  repetition  Monte  Carlo  and  analytic  approximations  for  some 
additional  comparisons  with  the  second  test  in  Table  2.  The  analytic 
approximations  to  p ^{Reject  Hq)  are  obtained  by  a normal  approxi- 

mation with  continuity  correction;  the  other  analytic  results  use 
Brownian  motion  with  mean  and  variance  given  by  (10)  and  (11) . These 
figures  suggest  that  in  terms  of  operating  characteristic  and  expected 
sample  size  there  is  no  strong  reason  for  preferring  one  test  to  the 
other.  If  the  values  of  c were  altered  slightly  to  make  the  over-all 
significance  levels  of  the  matched  pairs  test  and  the  two  population 
test  more  nearly  equal,  the  power  functions  would  tend  to  converge,  and 
the  two  tests  would  appear  to  be  even  more  similar. 

There  are  several  important  differences  between  the  matched 
pairs  test  and  the  likelihood  ratio  test  defined  by  (4).  As  was  noted 
earlier,  the  matched  pairs  test  cannot  be  used  unless  approximately 
equal  numbers  of  patients  are  allocated  to  treatment  and  control, 
whereas  the  likelihood  ratio  test  has  no  such  restriction.  Even  in  the 
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r 


case  of  equal  allocations  to  treatment  and  control  the  matched  pairs 
test  has  the  disadvantage  that  its  maximum  sample  size  is  a random 


1 


variable.  If  N denotes  the  number  of  pairs  (x  ,y  ) which  must  be 

observed  until  n^  = m,  then  N is  the  waiting  time  until  m "successes" 

in  Bernoulli  trials  with  success  probability  6 = Pj^  + p2ql'  Hence» 

2 

E.  .N  = m/6  and  var,  . (N)  = m(l-6)/6  . If  in  the  preceding 
(P1»P2^  (P1»P2^ 

example  p = p =.5,  then  E,  . (N)  = var,  . (N)  = 98.  However, 

if  p..  = p,  = .3,  then  E,  . (N)  = 116.7  and  var,  . (N)  = (12. 7)2, 
i 1 'pl,p2^  (P1»P2) 

so  one  may  with  reasonably  large  probability  attain  49  pairs  having 

x ^ y in  fewer  than  116.7  - 12.7  = 104  or  more  than  116.7  + 12.7  = 130 
n n 

total  pairs.  The  situation  is  worse  for  larger  departures  from  6 = 1/2. 
Without  some  compensating  feature  it  seems  doubtful  that  one  would  pre- 
fer this  test  to  that  determined  by  (4) , except  perhaps  in  those  cases 
when  p^  and  are  expected  to  be  near  0 or  1. 

There  are,  however,  compensating  features,  the  most  important 
of  which  is  the  use  of  the  matched  pairs  test  to  deal  with  stratified 
populations.  By  pairing  observations  within  strata  it  is  often  possi- 
ble in  effect  to  increase  the  value  of  A and  hence  the  sensitivity  of 
the  test.  (See  Armitage,  1975,  especially  p.  86.) 

Upper  gastrointestinal  hemorrhage  has  several  common  causes. 

Very  tentative  figures  from  previous  studies  suggest  that  both  the 
spontaneous  remission  rate  and  the  success  rate  with  vasopressin  vary 
with  cause.  With  some  grouping  of  causes  according  to  these  apparent 
remission  rates,  perhaps  90-95%  of  all  patients  can  be  put  into  three 
strata:  (a)  esophagitis,  gastritis,  Mallory-Weiss  tear;  (b)  various 
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ulcers;  and  (c)  esophageal  varices.  Highly  speculative  success  rates 

with  and  without  vasopressin  may  be  approximately  .7  and  .5  for  (a), 

.5  and  .3  for  (b) , and  .7  and  .3  for  (c) . It  is  also  difficult  to 

guess  the  size  of  the  various  strata.  (For  example,  the  percentage  of 

variceal  patients  is  larger  in  populations  with  a large  proportion  of 

alcoholics — hence,  typically  larger  at  V.  A.  hospitals  than  at  others.) 

But  the  ratio  25:25:50  may  be  a reasonable  approximation.  If  these 

figures  are  correct,  then  in  the  entire  population  p^  = .65  and 

~ so  random  pairing  gives  X = .775.  Matching  pairs  within 

strata  gives  X = .778.  Modest  variations  in  these  assumed  figures 

consistently  yield  differences  in  X of  less  than  .01.  In  general, 

stratification  increases  slightly  the  average  number  of  pairs  required 

to  obtain  a pair  with  x 4 y , which  cancels  to  some  extent  the  small 

n 'n 

advantage  gained  by  increasing  X. 

The  failure  of  stratification  to  produce  a more  dramatic  effect 
is  presumably  due  to  the  fact  that  with  the  maximum  sample  sizes 
envisaged  here,  a treatment  effect  with  an  appreciable  probability  of 
being  detected  must  be  as  large  or  larger  than  differences  among 
strata.  For  larger  experiments  designed  to  detect  smaller  treatment 
effects,  the  value  of  stratification  should  be  re-examined. 

The  preceding  analysis  ignores  the  possibility  of  a time  trend, 
against  which  pairing  of  patients  according  to  their  arrival  time  pro- 
vides protection.  The  history  of  the  treatment  of  upper  gastrointes- 
tinal bleeding  and  the  simple  mode  of  administration  of  vasopressin  in 
this  trial  suggest  that  time  trends  do  not  pose  serious  problems. 
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In  summary,  the  modest  increase  in  sensitivity  of  the  matched 
pairs  test  in  the  range  of  parameters  and  sample  sizes  considered  for 
the  vasopressin  study  does  not  seem  sufficient  compensation  for  the 
random  variation  in  its  maximum  sample  size  and  its  failure  to  deal 
adequately  with  unbalanced  allocations  to  treatment  and  control. 


I 
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APPENDIX  A 


(A. 3) 


Pg{cm^<  | s_  | < bnT5} 


m'  — 


The  first  term  may  be  approximated  using  results  of  Siegmund  (1977). 

w 

If  b -*■  “ and  m 00  in  such  a way  that  b = m20^,  then 


(A.  5) 


, -b2/2 

P0(T<m)  - (2/tt)^  b e /“  [v(x)/x]dx  , 


where 


(A.  6)  v(x)  = 2 exp  [- 2 I , n ^ $(--^x  n^)  ]/x^ 

n=i  z 


The  approximation  (A. 5)  is  very  accurate  when  mg  = 1.  For  larger 

values  of  one  seems  to  obtain  a better  approximation  by  assuming  in 

U 

addition  that  mQ  -*•  » and  b = n»Q0Q*  Then 


PROBABILITY  APPROXIMATIONS  FOR  NORMAL  DATA 


Let  x^,X2,...,  be  independent  normally  distributed  random 

variables  with  mean  0 and  variance  1.  Let  s ■ x,  + ... +x  and 

n 1 n 


(A.l) 


T * first  n > m_  such  that  Is  > bn^ 
— 0 1 n 


Let  m > titg  and  0 < c b.  The  over-all  significance  level  of  the  modi- 
fied repeated  significance  test  studied  in  Section  2 is 

(A. 2)  a = P.(t  < m)  + Pa(t  > m,  | s | > cm*5} 

0 — 0 1 m' 

An  upper  bound  for  the  second  term  which  is  fairly  accurate  when  c is 
small  compared  to  b is 
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(A. 7)  Pr.{T<m}  = P_<  ! s | > bin5}  +P.{mr,<T<m} 
u — u in  u u — 


u -d/2  f0 

2[1  -9(b)  ] + (2/tt;  b e U 


[v(x)/x]dx 


Although  the  first  term  on  the  right  hand  side  of  (A. 7)  is  of  smaller 
order  of  magnitude  than  the  second,  and  hence  there  is  no  good 
mathematical  reason  for  including  it,  its  inclusion  seems  to  give  more 
accurate  numerical  results  when  compared  with  simulations 
(cf.  equation  (50)  of  Siegmund,  1977). 

It  is  easy  to  compute  the  function  v(x)  defined  in  (A. 6) 
numerically.  Woodroofe  (1978b)  has  tabled  an  integral  equivalent  to 
that  in  (A. 5).  Siegmund  (1979)  has  shown  that  for  x less  than  about  2, 
v(x)  is  well  approximated  by  exp(-.583x),  and  hence  for  many  values  of 
0q  and  0^  the  integrals  in  (A. 5)  and  (A. 7)  may  be  obtained  from  tables 
of  the  exponential  integral  and  the  value 


/2  [v(x)/x]dx  = .224 


Let  T+  be  defined  by  (A.l)  with  s^  in  place  of  | | . According 

It 

to  Siegmund  (1978),  if  tr*®,  nr*10,  and  b = m 0^,  for  each  fixed  0 > 0,  x>0 

(A. 8)  Pq{T  < m,  s < bm^  - x)  = v(0,  )<p  [m2(01  - 0)  ]e  ^/m*5  0 

u + m il 

For  Brownian  motion  the  corresponding  approximation  has  1 in  place  of 

V(0X) . Together  with  the  obvious  decomposition 

u u 

P„{T,  < m)  = PQ(s  >bm1}  + P0{T,  <m,  s <bi  } , 

OH — am  o+  m 

(A. 8)  allows  one  to  approximate  Pg{T<^m)  for  0^0. 
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APPENDIX  B 


APPROXIMATE  EXPECTED  SAMPLE  SIZE  FOR  NORMAL  DATA 

Let  (x(t) , 0<^t<°°}  be  a Brownian  motion  process  with  drift  9 
and  variance  1 per  unit  time.  Let  T = inf{t  : t>mQ,  |x(t)|  = bt^} . It 
may  be  shown  (e.g.,  Siegmund,  1977)  that  for  each  9/0,  as  b -*■  00 

(B.l)  E0T  = (b2  - l)/92  + o(l)  . 

For  discrete  normal  random  walk  the  corresponding  expansion  contains  a 
term  to  account  for  excess  over  the  stopping  boundary,  which  can  be 
computed  numerically  and  for  small  9 is  about 

(B. 2)  1.166/9 

(cf.  Lai  and  Siegmund,  1979). 

For  sequential  tests  of  the  kind  discussed  in  this  paper  the 
expected  sample  size  is 

(B • 3)  Eq  min(T,m)  = Efl  T - / EQ  (T  - m| X(m) ) dPQ  . 

9 9 (T>m)  9 9 

h 

Suppose  that  b -*■  00  and  m 00  in  such  a way  that  b = 0^m  . For  9 in  a 
neighborhood  of  0^,  say  0 = 0^  + £m  2,  it  is  possible  to  estimate  the 
second  term  on  the  right  hand  side  of  (B.3)  to  provide  reasonable 
approximations  to  E0min(T,m). 

u 

Theorem.  Suppose  b -*■  «>  and  m -*■  00  so  that  for  some  9^  / 0,  b = ©^m  . 
For  0 = 0^  + Cm  ^ 
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(B.4)  E0min(T,m)  = (b2  - l)/02  - (m*5  [0  [$(£)  - K *(-  O ]) 


+ ei2[$(-0(l  + ?2)  -5*(0]  + o(l) 


A sketch  of  a proof  goes  as  follows.  By  (B.l)  it  suffices  to 
consider  the  second  term  on  the  right  hand  side  of  (B.3),  which  may  be 
rewritten  as 


(B . 5)  /q  P0{X(m)  € 9^m  - dx)  (1  - Pq(t  < m|  X(m)  = 0^  m - x})  Eg  T (x)  , 


where 


T (x)  = inf{t  : t > 0,  X(t)  =0,  m*5  [(m  + t)*5  - m*5]  + x) 
m 1 

i<  u i 

Since  0^m  [(m+t)  -m  ] < -^  8^t,  a standard  argument  using  Wald's 
identity  yields 


(B . 6) 


vm(x) 


1/8 

Writing  the  integral  in  (B.5)  as  the  sum  of  integrals  over  (0,m  ), 

1/8  1/2  1/2 

(m  , m log  m)  and  (m  log  m,  00 ) , one  sees  from  (B.6)  that  the 

first  and  third  integrals  converge  to  0 as  m ■+■  00 . It  may  also  be  shown 

1/8 

as  in  Siegmund  (1977)  that  uniformly  in  x > m , 

Pq(t  < m| X(m)  = 0^  m - x)  = o(m  1)  , and  hence  by  (B.5)  and  (B.6)  it 
suffices  to  find  an  approximation  for 


(B.  7) 


1/2  . 

-1/2  (m  1°8 m i/2  -1/2 

n 4> (m  (0.  -0)  - xm  )EQ  T (x)dx 

1/8  1 9 m 

m 


A Taylor  series  expansion  and  some  calculation  with  Wald's  identity 

h 

shows  that  uniformly  for  x < m log  m 

E0  Tm(X>  = x/(0  ~^®1^  “ x2/m0^  + o(x2/m)  , 

which  when  substituted  into  (B.7)  yields  the  theorem. 

For  the  entries  in  Table  1,  the  quantity  (B.2)  was  added  to 

(B.4)  to  obtain  a slightly  better  approximation  to  EQT.  It  seems 

doubtful  that  estimating  the  excess  over  the  boundary  in  the  correction 

term  . E0(T-m|s  )dPQ  is  worth  the  effort,  since  this  term  is 

tl>m;  o m 

already  relatively  small  in  those  cases  where  the  over-all  approxima- 
tion can  be  expected  to  be  accurate.  The  entries  in  Table  2 were 
obtained  from  the  Brownian  motion  approximation  with  mean  and  variance 
given  by  (5)  and  (6) . 


APPENDIX  C 

APPROXIMATE  SIGNIFICANCE  LEVEL  FOR  BERNOULLI  DATA 


Let  xn,  y^,  H,  I,  and  8,^  be  as  in  Section  3.  The  stopping  rule 


T defined  by  (4)  may  be  rewritten 


(C.l) 


T = first  n > m_  such  that  l > a , 
— 0 n 


where  a = b / 2.  The  significance  level  of  the  sequential  test  studied 
in  Section  3 is  given  by  (7) . In  this  appendix  an  asymptotic  expres- 
sion similar  to  (A. 5)  is  obtained  for  P,  .(Km)  asm  + l»l  m.  +®, 

(p,p)  - ’O’ 

b -*•  00  in  such  a way  that  b = m 0^  = m^0Q.  The  method  utilizes  the 

non-linear  renewal  theorem  and  an  interesting  adaptation  of  the  methods 

of  Lai  and  Siegmund  (1977).  Since  the  computations  are  rather 

elaborate,  they  are  only  given  heuristically . The  following  likelihood 

ratio  identity  is  also  very  helpful  in  simulating  a. 

Let  Fj  c F^  c , , . be  an  increasing  sequence  of  sub-<J-algebras 

of  a basic  a-algebra  F.  Let  P and  Q be  two  probabilities  on  F such 

that  the  restriction  P^  of  P to  F^  is  absolutely  continuous  relative 

to  the  corresponding  restriction  of  Q.  Let  L = dP^/dQ^  be 

n 

the  likelihood  ratio  of  these  restrictions.  One  version  of  the  funda- 
mental identity  of  sequential  analysis  says  that  for  any  stopping  time 
a and  any  event  A such  that  A H (a  = n)  € F for  all  n. 


(C.2) 


P(Afl  (a  <<*>})  = f L dQ  . 

An{a<°°} 


(The  proof  follows  at  once  by  writing  {a  < °°}  *=  U (a  = n)  and  using  the 

n=l 

additivity  of  the  integral.) 
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In  what  follows  P,  , will  be  as  in  Section  3 and 
(P1»P2> 

Q ■ /01/01  P(Pl>p2)  d|,l  dp2  • 


Taking  P - P(pip)  gives 


<C-3)  Ln  ’ <!p/d0<”)  ' (s"J  (s°)  PS,,  + S”  P2"'80'8"  , 


where  s^  = and  s^  = l”  y^.  The  identity  (C.2)  gives  representa- 
tions for  P,  .{T<m}  and  P,  ,.{T>m,  (25,  )^>c)  which  are  useful  in 

estimating  these  probabilities  by  Monte  Carlo  methods.  One  samples 
(Xi,yi) , ^ . according  to  Q and  estimates  P^  ^^{T<^m}  , for 

example,  by  averages  of  L^.  (See  Siegmund,  1975,  for  a general 

discussion  of  such  importance  sampling  in  sequential  analysis  and  Lai 
and  Siegmund,  1977,  for  an  application  in  a context  similar  to  the  pre- 
sent one.)  This  estimator  has  three  advantages  over  direct  simulation: 
(i)  its  variance  is  smaller;  (ii)  the  expectation  under  Q of  min(T,m) 
is  smaller  than  under  P^  ^ where  it  essentially  equals  the  maximum 
sample  size  m;  and  (iii)  P ^ ^{T£m}  may  be  estimated  simultaneously 
for  several  values  of  p using  the  same  random  numbers.  For  estimating 
a(p)  for  a test  with  c small  compared  to  b this  technique  is  not 
variance  reducing,  but  advantages  (ii)  and  (iii)  hold  in  this  case  as 


In  contrast  to  the  case  of  normal  variables,  where  a direct 
representation  of  the  probability  that  T <_  m by  means  of  (C.2)  provided 
the  starting  point  of  a fruitful  asymptotic  analysis,  in  this  case  an 
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indirect  approach  seems  advisable.  Let  u > 0,  v > 0,  and  let 


(C.  4) 


P = /0  P(C.O  dC/B(u+l,  v+1) 


so 


(C. 5)  Ln  = dP(n)/dQ(n)  = (n+1)2  J’gn^"J/B(irfl,v+l)(u+v+2n+l)^U+V+2^  . 


Of  course,  P defined  by  (C.4)  depends  on  u and  v.  If  u + ” and  v -*■  00 
in  such  a way  that  u/(u  + v)  -*■  p,  then  the  distribution  with  density 


CU(l-C)V/B(u+l,  v+1)  converges  weakly  to  a point  mass  at  p,  so  for 


each  fixed  m 


(C.6) 


P(T<  m}  -*•  P,  .{T<  m}  . 
~ (P,P)  ~ 


Hence  for  large  u and  v with  u/(u  + v)  = p an  approximation  for  P{T£m) 


"should  be"  an  approximation  for  P^  ^^{T^m}. 


Fix  0 < P^»P2  < Stirling's  formula  and  the  strong  law  of 


large  numbers  applied  to  (C.3)  show  that  with  P ^ ^ ^ probability  one. 


as  n -*■  “ 


(C  .7)  log  Ln  = - Jln  + -|logn  + log  p + u log  p + v log  q 


- log  2 TT5  - logB(u+l,  v+1)  + o(l)  , 


^ 2 

where  p = _2^P2  + P2^  an^  9 = 1”P*  Suppose  now  that  a = b /2  00 , 


m^  -*•  °°,  and  m -*■  00  in  such  a way  that  b = 9^m  = G^m^.  Substitution  of 


(C.7)  into  (C.2)  and  an  argument  similar  to  that  of  Lai  and  Siegmund 
(1977)  yields 
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, 


(C.8)  P(t  < m} 


~ -|[B(u+l,  v + 1)  ] 1(tt  1a)is  e 3 

* /()1  /()1  (Km)6  T (T/a)l5(^^/PlqlP2q2)l5dP(p1,P2)iVdpldp2 


- |[B(u  + l,  v+l)]"1(Tr"1a)ii  e'a 

* f f v(p  p )[l(p  ,p  )]  ^(p  q/p.q.p.q.)'5  p^  dp  dp., 

t (Pl»P2)  :01<  [2I(p1,P2)],S<  9q} 

where 

V(p.,p9)  = lira  E-  .exp[-(£  -a)] 

1 2 a-+°°  (pl*p2)  T 


exists  by  an  application  of  Theorem  1 of  Lai  and  Siegmund  (1977). 
(Actually,  in  order  that  this  theorem  be  applicable  it  is  necessary 
that  a certain  random  walk  associated  with  the  process  8,  be  non- 
arithmetic, which  is  the  case  for  all  p^,p2  with  at  most  a denumerable 
number  of  exceptions.  This  suffices  in  view  of  the  subsequent  integra- 
tion over  p^  and  p2») 

Now  suppose  that  u *►  00  and  v -►  00  with  u/(u  + v)  » p <_  -j.  Some 
calculation  shows  that  the  measure  v(dp^,dp2)  = 

[B(u+1,  v+1)]  ^ p11  dp^dp2  has  total  mass  converging  to  4p  and 

1 \ a 

converges  weakly  to  a measure  uniform  on  y(p^  + p2)  = p.  Hence,  a e 
times  the  right  hand  side  of  (C.8)  converges  to 
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(C . 9)  tt_!s  C(C,2p-C)[l(C,2p-C)]'*i  fpq/C(l-C)(2p-C)(2q-l+e)]J4  dC  . 

(0,2p)  0(5:0^  [2I(C,2p-C)]Js<  0O) 

Together  with  (C.6)  this  suggests  that 

(C.10)  P(pjp){T<m)  - C(p;0(),01)als  e"a  (0<p<-|)  , 

where  C(p;0Q,©1)  denotes  the  expression  in  (C.9).  For  y <_  p < 1,  a 
similar  result  holds  with  C(q;0g,0^)  in  place  of  C(p;0Q,0^). 

It  should  be  emphasized  that  the  preceding  argument  is  only 
heuristic,  although  it  seems  to  be  possible  to  make  it  rigorous  by 
taking  u and  v as  functions  of  m which  tend  to  00  slowly  with  m.  The 
final  result  appears  to  agree  formally  with  a similar  very  general 
result  of  Woodroofe  (1978a),  which  however,  does  not  apply  in  this  case 
because  certain  smoothness  conditions  important  to  Woodroofe' s method 
are  not  satisfied. 
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